Episodic Reinforcement Learning Control Approach for Biped Walking
نویسندگان
چکیده
This paper presents a hybrid dynamic control approach to the realisation of humanoid biped robotic walk, focusing on the policy gradient episodic reinforcement learning with fuzzy evaluative feedback. The proposed structure of controller involves two feedback loops: a conventional computed torque controller and an episodic reinforcement learning controller. The reinforcement learning part includes fuzzy information about Zero-MomentPoint errors. Simulation tests using a medium-size 36-DOF humanoid robot MEXONE were performed to demonstrate the effectiveness of our method.
منابع مشابه
Dynamic Control Algorithm for Biped Walking Based on Policy Gradient Fuzzy Reinforcement Learning
This paper presents a novel dynamic control approach to acquire biped walking of humanoid robots focussed on policy gradient reinforcement learning with fuzzy evaluative feedback . The proposed structure of controller involves two feedback loops: conventional computed torque controller including impact-force controller and reinforcement learning computed torque controller. Reinforcement learnin...
متن کاملPoincaré-Map-Based Reinforcement Learning For Biped Walking
We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately modulate an observed walking pattern. Viapoints are detected from the observed walking trajectories using the minimum jerk criterion. The learning algorithm modulates the via-points as control actions to improve walking trajectories. This decision is based on a learned model of...
متن کاملBiped Balance Control by Reinforcement Learning
This work studied biped walking with single (one-leg) support and balance control using reinforcement learning. The proposed Q-learning algorithm makes a robot learn to walk without any previous knowledge of dynamics model. This balance control with single support shifts the Zero Moment Point (ZMP) of the robot to a stable region over walking sequences by means of learned gestures. Hence, the p...
متن کاملDOCTORAL THESIS PROPOSAL Biped Locomotion: Augmenting an Intuitive Control Algorithm with Learning
Foot placement is a key determinant for the stabilization of walking speed and lateral motion of a biped. However, there is no closed form expression for the foot placement parameters in term of the walking speed or other gait parameters. A simple and intuitive control algorithm (called “Turkey Walking”) based on Virtual Model Control (VMC) was successfully applied to planar bipedal walking. Ho...
متن کاملA Simple Reinforcement Learning Algorithm For Biped Locomotion
We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately place the swing leg. This decision is based on a learned model of the Poincare map of the periodic walking pattern. The model maps from a state at the middle of a step and foot placement to a state at next middle of a step. We also modify the desired walking cycle frequency bas...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012